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© Expression of g-csf and mutelns thereof and their use. 

© Human recombinant G-CSF (hr-G-CSF), an mRNA frac- 
tion encoding hr-G-CSF, a DNA sequence encoding hr-G-CSF, 
modified DNA sequences encoding hr-G-CSF, that are ex- 
pressed at high levels in recombinant hosts are disclosed. 
Modifications to the 5' DNA sequence encoding mature mG- 
CSF are also disclosed. The modifications to the third posi- 
tion nucleotide of at least one in the 5' region codon of the 
DNA sequence of G-CSF result in enhanced expression of 
mG-CSF protein in recombinant host cells, without changing 
the amino acid sequence of the protein. Other modifications 
yield muteins of G-CSF in which the NH 2 -terminal amino acid 
is not methionine when produced from a recombinant host 
cell. Expression vectors carrying the altered mG-CSF DNA 
sequence are disclosed and claimed. 
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EXPRESSION OF G-CSF AND MUTEINS THEREOF AND THEIR USE 

The Invention concerns the field of recombinant DNA. In 
particular, the Invention concerns the Isolation of nucleic acid 
sequences encoding granulocyte colony-stimulating factor (G-CSF) and 
5 the expression of G-CSF 1n various host cells. The invention concerns 
modified DNA sequences encoding G-CSF and mutelns thereof that are 
efficiently expressed under the control of promoters operable 1n E. 
col 1, expression vectors utilizing the modified DNA sequences for the 
efficient expression of G-CSF in E. coli and Gram-negative hosts known 
10 to exchange genetic material with E. coli , host organisms Including 
microorganisms transformed with the expression vectors and G-CSF 
produced thereby. 

Interest 1n the growth and differentiation of granulocytes 
and macrophages from bone marrow progenitors has prompted the study of 

15 the colony-stimulating factors (CSFs) that regulate these processes. 
The subject has been recently reviewed by D. Metcalf in Blood, 67:257 
(1986). One of these factors, termed granulocyte-CSF (G-CSF) 
stimulates the in vitro growth of predominantly granulocyte colonies 
from bone marrow stem cells. The availability of cell lines that 

20 produce CSFs has been Instrumental 1n the characterization of G-CSF; 
however, proper interpretation of this work requires the precise 
identification of the CSF released by these cells. One such cell 
line, the MIA PaCa-2, derived from a human pancreatic carcinoma, 
reportedly produces both the macrophage-speclflc CSF-1 and a CSF that 

25 can Induce the formation of primarily granulocyte colonies in seven 
day bone marrow cell cultures. Wu, M. C, J. Clin. Invest., 67; 1588 
(1981). A CSF-1 cDNA clone has been recently isolated from this cell 
line, Kawasaki et a!., Science, 230 :291 (1985). They also showed, by 
translation of sucrose gradient fractionated mRNA In Xenopus laevis 

30 oocytes, that a transcript that sedlmented slower than the 18s marker 
encoded a CSF that Induced the proliferation of murine bone marrow 
cells; however, the CSF-1 clone hybridized most strongly to mRNA in 
the fractions that sedlmented faster than the 18s marker. 
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Human G-CSF causes the differentiation of granulocytes and 
monocytes from bone marrow progenitor cells. It also Induces some 
leukemia cells to differentiate Into more mature cell types. Thus, 
human G-CSF may be useful 1n regeneration of Important blood cell 
5 populations 1n Immunologically compromised patients and patients with 
anemic disorders or infections. G-CSF may also be useful in 
inhibiting leukemia by causing leukemia cells to differentiate to a 
non-dividing state, thereby checking the uncontrolled proliferation of 
these cell types. Myeloid leukemia cells are known to have receptors 

10 for G-CSF, for example. Science, 323 :7 (1986). 

Souza et al . have isolated, cloned, and expressed a gene 
encoding G-CSF, which has no significant homology to GM-CSF, in an 
unidentified strain of E. coll using a modified temperature sensitive 
runaway plasmid with a X phage PL promoter and synthetic DNA fragment 

15 containing an Initiation codon, followed by the sequence encoding the 
mature form of unaltered G-CSF. No data on expression levels are, 
however, disclosed. Souza et al.. Science , 232 :61-65 (1986). 

In addition, a colony stimulating factor having the same N- 
terminal 20 amino acids as that of Souza et al . was Isolated from a 

20 cultured human oral carcinoma as disclosed -in European Patent 
Publication 0169,566 to Nagata et al . This clone designated CHU-2, 
has an additional sequence encoding Val-Ser-Glu just prior to the 
C ys 36 residue of the protein encoded by the ONA sequence cloned from 
MIA PaCa-2, as described herein below. No information is given in 

25 either of the above-mentioned disclosures as to the levels of 
expression in recombinant hosts, particulaly E. coli . Nagata et al . 
subsequently reported a cDNA sequence that lacks the sequence encoding 
the above-mentioned additional tri-peptide sequence. Nagata et al., 
EMBO Journal 5:3, 575-581 (1986). 

30 Reportedly, both of these recombinantly produced G-CSFs have 

NH 2 terminal methionines as a result of expression In the recombinant 
bacterial host E. coli . Such NH 2 -terminal methionines are generally 
known to be immunogenic when the protein is administered parenterally 
to a mammalian host. 
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A number of factors are Involved 1n obtaining maximal 
expression of a protein in a recombinant host. Among these factors 
are optimization of the physiological conditions for growth of the 
host cell by changes to the culture media 1n which the host grows. 
5 Also important is the selection and use of strong promoter and 
ribosome binding sites (RBS) that are operable 1n the expression 
vector to be used. The elimination of attenuator regions if they 
occur 1n the region of regulated promoters such as the tryptophan 
operon is also desirable. 

10 A more subtle factor in maximizing protein expression in a 

recombinant host relates to codon utilization. In brief, codon 
utilization may be described as the ability of a particular 
recombinant host organism to translate some codons encoding a 
particular amino add more quickly and accurately than other different 

15 codons encoding the same amino acid. Thus, although the trinucleotide 
code specifying a particular amino add 1s degenerate, I.e., more than 
one trinucleotide sequence can encode a particular amino acid, all 
codons encoding a particular amino acid are not utilized by the host 
with equal efficiency. Enhanced expression may be obtained by 

20 changing the nucleotide sequence of a gene 1n a recombinant host to 
the host preferred codons without changing the amino acid sequence of 
the protein encoded thereby. Additionally, changes in the amino acid 
sequence of the NH 2 terminus may effect processing of the NH 2 -term1nal 
methionine mentioned above. 

25 Another subtlety in the enhancement of gene expression in an 

expression vector in a recombinant host relates to secondary structure 
formation In the mRNA sequence to be expressed. It has been shown, 
for example, that certain "stem and loop" secondary structures placed 
in proximity to the 3' terminus of a gene, lead to increased stability 

30 of the mRNA transcript and Increased expression of the protein encoded 
thereby, Wong et al., Proc. Nat. Acad. Sci. (USA), 83:10, 32-33 
(1986). However, 1t has been suggested that formation of stable 
secondary structures at the 5' end of an mRNA transcript may block 
efficient translation thereof, leading to reduced yield of the protein 

35 encoded by the gene. The influence of mRNA secondary structure on 
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translation has been discussed 1n Iserentant & Fiers, Gene , £:1-12 
(1980) and Buell et al.. Nucleic Add Research, 13_ :1923 " 1938 U985). 
DeLamarter. J. F. , et al., EMBO Journal . 4,10:2575-2581 (1985) have 
reported that the yield of GM-CSF has been Increased 1n E. coll by 
5 means of altering third position deoxyribocytosine to 
deoxyriboadenines 1n the first ten of the 5' codons of the DNA 
encoding mature GM-CSF, and attribute the Increase in protein 
synthesis to the altered 5' end of the mRNA transcript. 

It would be desirable to obtain large amounts of G-CSF for 

10 development of therapies utilizing the granulocyte stimulating and 
granulocyte-macrophage stimulating characteristics of this colony 
stimulating factor. Furthermore, it would be desirable to obtain 
forms of G-CSF that lack NH 2 -terminal methionine when expressed in the 
recombinant host, and particularly 1n E. coli. 

15 The present invention makes available G-CSF 1n large 

quantities by producing 1t at high levels in a number of hosts. 

The Invention discloses that a previously Isolated mRNA 
fraction from MIA PaCa-2 cells having high bone marrow proliferative 
activity encodes G-CSF. The invention encompasses the G-CSF mRNA. In 

20 addition, the Invention encompasses a cDNA sequence made from this 
mRNA fraction that encodes the G-CSF gene. 

The Invention also includes DNA sequences encoding mature G- 
CSF; these DNA sequences have been altered to facilitate expression of 
G-CSF in recombinant hosts. In addition, DNA sequence encoding mature 

25 G-CSF or a mutein thereof lacking NH 2 -terminal methionine when 
expressed in a recombinant host, are disclosed. 

At present, investigators have encountered great difficulty 
1n obtaining significant expression of mature G-CSF in E. coli hosts 
under P L promoter gene N-RBS control or tryptophan (Trp) promoter 

30 control when the unaltered nucleotide sequence of the mature G-CSF 1s 
used. The Inventors have determined that, when under P L gene N-RBS 
control in E^ coli, steady state levels of mRNA encoding G-CSF are 
detectable by Northern analysis as described further herein. Under 
the control of the Trp promoter, no steady state G-CSF mRNA is found 
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in E. coli by the same method. Surprisingly, In light of the work of 
Delemarter, et al . which suggests that alterations of the 5' end of 
the GM-CSF gene affects secondary structure of the GM-CSF mRNA 
sequence and the translation thereof, the inventors have found that 
5 alterations in the 5' region of the G-CSF gene leads not only to 
increased expression of G-CSF protein in both P L gene N-RBS and Trp 
controlled expression vectors in E. coli , it also leads to increased 
levels of mRNA steady state transcript in Trp controlled expression 
vectors in E. coll . 

10 Figure 1A is a plot of bone marrow proliferation activity 

obtained for Xenopus oocyte supernatants made from the fractionated 
MIA PaCa-2 mRNA. 

Figure IB is a dot blot of MIA PaCa-2 mRNA fractions with a 
CSF-l-specif ic probe. 

15 Figure 2 is a Northern blot of the RNA obtained from MIA 

PaCa-2 cells (lane 1), LD-1 cells (lane 2), and 5637 cells (lane 3), 
probed with two different 24-mer probes- In 2A the probe was 5 1 - 
ATGGCTGGACCTGCCACCCACAGC-3 1 , which hybridized to RNA of all three 
lanes. In 2B the oligomer probe was 5 ' -AGAAGCT GGTGAGTGA6T GTGCCA-3 ' . 

20 The nine nucleotides underlined appear only in the CHU-2 sequence that 
purports to be G-CSF (Nagata et al., supra ) . This probe did not 
hybridize to any of the cell lines tested under stringent conditions. 

Figure 3A is a plot of the bone marrow proliferation 
activity of the supernatants collected from Xenopus laevis oocytes 40 
25 hours after injection of the pooled MIA PaCa-2 RNA fractions* 
Activity was measured by the murine bone marrow cell proliferation 
assay described in the examples below. 

Figure 3B is a Northern blot of the pooled MIA PaCa-2 mRNA 
fractions using a G-CSF specific Y 32 P labeled ol igodeoxyrlbonucleotide 
30 probe. 

Figure 4 shows the complete sequence of G-CSF cDNA obtained 
from the sequencing of the BamH I digest of plasmid pP12 and deduced 
amino acid sequence encoded thereby. The arrow indicates the position 
of the nine nucleotide sequence GTGATGGAG of the Nagata et al . clone. 
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Figure 5 1s a schematic Illustration showing the prepartion 
of plasmld pPDl and the placement of a Hlndlll restriction site and 
ATG codon 1n the G-CSF sequence. 

Figure 6 1s a schematic Illustration of the prepartion of 
5 plasmld pPD2 in which the 3' untranslated region of the G-CSF gene Is 
excised and transcription termination and mRNA stability are placed 
under control of the positive retroregul atory element. 

Figure 7 is a schematic illustration of the preparation of 
plasmid pJDl in which G-CSF expression 1s under the control of the P L 
10 promoter and gene N-ribosome binding site. 

Figure 8 is a schematic illustration of the preparation of 
pJD4, pJD4A and pJD4B. 

Figure 9 is a schematic Illustration of the preparation of 

pPD5. 

15 Figure 10 Is a Northern blot of mRNA from pJOl and pPD2 

probed with a G-CSF specific ollgodeoxynucleotide probe. 

Figure 11 shows the DNA sequence and N-terminal amino add 
sequence encoded by native G-CSF, pJD4B, pPD5 and pJD4A. The ATG 
codon encoding Met was added to native G-CSF and does not occur 1n the 
20 cDNA sequence obtained from MIA PaCa-2 transcripts. 

Figure 12 1s an SDS gel showing expression of G-CSF in 
induced and uninduced E. coli host carrying pJD4A and pJD4B. 

Figure 13 is an SDS gel showing expression of G-CSF in 
Induced and uninduced E. coli host carrying pPD5. 

25 Figure 14 is a schematic Illustration of the preparation of 

plasmid pPD6 1n which Vj-CSF expression is under control of the P L 
promoter and gene N-ribosome binding site. 

Figure 15 1s an SDS gel showing expression of mG-CSF and Vj- 
mG-CSF. The ^-mG-CSF mobillly reflects a slightly smaller size 

30 consistant with deletion of NH 2 -terminal Met and Thr. 
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A. G-CSF Cloning and Expression 

The G-CSF of the present Invention was detected in a 
purified mRNA fraction obtained by sucrose density gradient 
centrifugation of induced MIA PaCa-2 cells as described in PCT 
5 Publication No- W086/04607 published August 14 t 1986 and assigned to 
the assignee of the present Invention. 

mRNA fractions obtained from the gradient were injected into 
Xenopus laevis oocytes; the oocytes were incubated, and the 
supernatants tested for the ability to stimulate murine bone marrow 

10 proliferation (bmp). Peak bmp activity was found in supernatants from 
oocytes that had been injected with the 14, 15 and 16S MIA PaCa-2 mRNA 
fractions although peak hybridization to a CSF-1 specific probe was 
obtained with the 18S mRNA fraction. This phenominon is illustrated 
in Figure 1A which shows a plot of bmp activity and CSF hybridization 

15 intensity per mRNA fraction. 

Further, investigation by the present inventors has 
determined that the peak bmp activity associated with the slower 
sedimenting RNA fractions 1s due to the activity of G-CSF as is shown 
in Figure 3A and 3B. These figures show that peak hybridization of a 
20 G-CSF specific probe with MIA PaCa-2 mRNA fractions sedimenting slower 
than 18S. Thus, the present invention encompasses an mRNA fraction of 
human cells that is capable of producing G-CSF in a host cell. 

"G-CSF" as used herein means a protein having the effect of 
stimulating the production of primarily granulocyte colonies or 
25 granulocyte-macrophage colonies 1n a colony forming assay using bone 
marrow cell progenitors of an appropriate species. A protein having 
this activity has the deduced amino acid sequence shown 1n Figure 4 
herein and 1s considered to be within the scope of the invention. 

G-CSF according to the invention may be isolated from the 
30 MIA PaCa-2 cell line. In addition, as shown 1n Figure 2, G-CSF 
specific RNA sequences are clearly detectable In the mRNA of LD-1 
cells and 5637 cells. Thus, G-CSF obtained by cloning from these cell 
lines as well as any other cell line which Is G-CSF probe positive, 1s 
considered within the scope of the invention. Mature G-CSF produced 
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by the LD-1 cell 11ne has been Isolated and amlno-acid sequenced. 65% 
of the LD-1 produced protein has a thr residue at the NH 2 - terminus and 
35% has a pro residue at the NH 2 -term1nus. 

As 1s the case for all proteins, the precise chemical 

5 structure of G-CSF depends on a number of factors. As lonizable amino 
and carboxyl groups are present in the molecule, a particular protein 
may be obtained as an acidic or basic salt, or 1n neutral form. All 
such prepartions which retain their activity when placed in suitable 
environmental conditions are Included in the definition. Further, the 

10 primary amino acid sequence may be augmented by derivatization using 
sugar moieties (glycosylate) or by other supplementary molecules 
such as lipids, phosphate, acetyl groups and the like, more commonly 
by conjugation with saccharides. The primary amino acid structure may 
also aggregate to form complexes. Certain aspects of such 

15 augmentation are accomplished through post-translational processing 
systems of the producing host; other such modification may be 
introduced in vitro . In any event, such modifications are included in 
the definition so long as the activity of the protein, as defined 
above. 1s not destroyed. It is expected, of course, that such 

20 modifications may quantitatively or qualitatively affect the activity, 
either by enhancing or diminishing the activity of the protein in the 
various assays. 

Further, individual amino acid residues in the chain may be 
modified by oxidation, reduction, or other derivatization, and the 

25 protein may be cleaved to obtain fragments which retain activity. 
Such alterations which do not destroy activity do not remove the 
protein sequence from the definition. 

Modifications to the primary structure itself by deletion, 
addition, or alteration of the amino acids Incorporated Into the 

30 sequence during translation can be made without destroying the 
activity of the protein. Such substitutions or other alterations 
result 1n proteins having an amino acid sequence which falls within 
the definition of proteins "having an amino acid sequence 
substantially equivalent to that of G-CSF". 
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For convenience, the mature G-CSF protein amino acid 
sequence shown in Figure 4, deduced from the cDMA clone illustrated 
herein, is designated mG-CSF (mature G-CSF) beginning at the amino 
acid residue threonine designated +1. Figure 4' shows the presence of 
5 a 30 residue putative signal sequence, which is presumably cleaved 
upon secretion from mammalian cells; mG-CSF is represented by amino 
acids 1-174 shown in that figure. Specifically included in the 
definition of human G-CSF are muteins which monomers and dimers, if 
any, are G-CSF and related forms of G-CSF, designated by their 
10 differences from mG-CSF. G-CSF derived from other species may fit the 
definition of "human" G-CSF by virtue of its display of the requisite 
pattern of activity as set forth above with regard to human substrate. 

Also for convenience, the amino acid sequence of G-CSF will 
be used as a reference and other sequences which are substantially 

15 equivalent to this in terms of G-CSF activity will be designated by 
referring to the sequence shown in Figure 4. The substitution of a 
particular amino acid will be noted by reference to the number of the 
amino acid residue which it replaced. Thus, for example, ser 60 G-CSF 
refers to the protein which has the sequence shown in Figure 4 except 

20 that the amino acid at position 60 is serine rather than proline. 
Deletions are noted by a v followed by the numer of amino acids 
deleted from the N-terminal sequence, or by the number of amino acids 
remaining when residues are deleted from the C-terminal sequence, when 
the number is followed by a minus sign. Thus, V4-G-CSF refers to G- 

25 CSF of Figure 4 wherein the first four amino acids from the N-terminus 
have been deleted; v 13 q- refers to G-CSF wherein the last 44 amino 
acids following amino acid 130 have been deleted. 

"Operably linked" or "in operable linkage" refers to 
juxtaposition such that the normal function of the components can be 
30 performed. Thus, a coding sequence "operably linked 11 to control 
sequences refers to a configuration wherein the coding sequence can be 
expressed under the control of these sequences. 

"Control sequences" refers to DNA sequences necessary for 
the expresion of an operably linked coding sequence in a particular 
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host organism. The control sequences which are suitable for 
procaryotes, for example, include a promoter, optionally an operator 
sequence, a RBS, and possibly, other as yet poorly understood, 
sequences. Eucaryotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 

"Expression system" refers to DNA sequences containing a 
desired coding sequence and control sequences in operable linkage, so 
that hosts transformed with these sequences are capable of producing 
the encoded proteins. In order to effect transformation, the 
expression system may be included on a vector; however, the relevant 
DNA may then also be integrated into the host chromosome. 

As used herein "cell", "cell line", and "cell culture" are 
used interchangeable and all such designations include progeny. Thus 
"transformants" or "transformed cells" includes the primary subject 
cell and cultures derived therefrom without regard for the number of 
transfers. It is also understood that all progeny may not be 
precisely identical in DNA content, due to deliberate or inadvertent 
mutations. Mutant progeny which have the same functionality as 
screened for in the originally transformed cell, are included. Where 
distinct designations are intended, it will be clear from the context. 



Control Sequences and Corresponding Hosts 

Procaryotes most frequently are represented by various 
strains of E. coli. However, other microbial strains may also be 
used, such as bacilli, for example Bacillus subtil is , various species 
25 of Pseudomonas , or other bacterial strains. In such procaryotic 
systems, plasmid vectors which contain replication sites and control 
sequences derived from a species compatible with the host are used. 
For example, E^coli is typically transformed using derivatives of 
pBR322, a plasmid derived from an E. coli species by Bolivar, et al., 
Gene, 2:95 (1977). pBR322 contains genes for ampidllin and 
tetracy7line resistance, and thus provides additional markers which 
can be either retained or destroyed in constructing the desired 
vector. Commonly used procaryotic control sequences which are defined 



30 
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therein to Include promoters for transcription initiation, optionally 
with an operator, along with RBS sequences, include such commonly used 
promoters as the beta-lactamase {penicillinase) and lactose (lac) 
promoter systems (Chang, et al. f Nature (1977) 198 :1056) and the 
5 tryptophan (Trp) promoter system (Goeddel, et al . t Nucleic Adds Res. 
(1980) J3;4057) and the lambda derived P L promoter and N-gene rlbosome 
binding site (Shimatake, et al., Nature (1981) 292:128), which has 
been made useful as a portable control cassette, as set forth 1n PCT 
Publication No. W085/03522 published August 15, 1985. However, any 
10 available promoter system compatible with procaryotes can be used. 

As described in detail hereinbelow, the native G-CSF 
sequence has not proved to be effective for the production of G-CSF in 
such prokaryotlc microbial hosts at high levels. In the present 
invention, alterations have been carried out to the 5' end of the mG- 
15 CSF coding sequence. In addition to placing an ATG start codon at the 
5' end of the DNA sequence encoding mG-CSF, a number of changes have 
been made in the nucleotide sequence thereof without changing the 
amino acid sequence of the mG-CSF protein encoded thereby. 

These nucleotide changes encompass codons primarily encoding 
20 the first 10 amino acids of mG-CSF. At least one nucleotide in the 
third position of at least one of these codons has been changed from a 
deoxyrlboguanldine or deoxyrlbocytosine to a deoxyrlboadenine. These 
changes are made without altering the amino acid residue encoded by 
the codon, and furthermore, the resulting altered codon 1s one which 
25 is not necessarily host-preferred, at least when the host cell 1s E. 
col i . More preferably, the third position codon alteration 1s carried 
out on at least one of the first 10 or 11 codons of mG-CSF wherein the 
5' amino acid sequence of mG-CSF remains 

ThrProLeuGlyProAl aSerSerLeuPro or MetThrProLeuGly ProAl aSerSerLeuPro , 
30 when the additional codon encoding an N-terminal Met 1s included. 

More preferably, the above-described G to A or C to A codon 
alterations are carried out on at least one of the first 4 or 5 codons 
of mG-CSF wherein the 5 1 amino acid sequence of mG-CSF remains 
ThrProLeuGly or MetThrProLeuGly when the additional codon encoding an 
35 N-terminal methionine is Included. 
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In a more preferred embodiment, the above-mentioned changes 
to the third position nucleotide of codons encoding the 5' end of mG- 
CSF are carried out to at least three of the 10 codons of the DNA 
sequence encoding the amino acid sequence 

5 ThrProLeuGlyProAlaSerSerLeuPro. 

In an even more preferred embodiment, the above-mentioned 
changes to the third position nucleotide of codons encoding the 5' end 
of mG-CSF are carried out to at least three of the 4 codons of the DNA 
sequence encoding the amino acid sequence ThrProLeuGly. 

10 when the above-mentioned changes are made in the 5* end of 

the DMA sequence encoding mG-CSF that is in an E. coli compatible 
expression vector 1n operable linkage with the Trp promoter, both 
steady state mRNA transcripts encoding mG-CSF and a protein fraction 
having mG-CSF activity are readily detectable by Northern blotting and 

15 Xenopus laevis oocyte assay, bone marrow proliferation, or granulocyte 
colony stimulation assays, respectively. By contrast, the native mG- 
CSF DNA sequence in otherwise operable linkage with the Trp promoter 
yields neither mRNA transcript nor detectable by Northern blot nor a 
protein fraction having G-CSF activity in the above-mentioned assays. 

20 The native G-CSF DNA sequence in an expression vector in 

operable linkage with the P L promoter gene N-RBS, yields detectable 
steady state mRNA transcripts encoding G-CSF, but no protein fraction 
having mG-CSF. However, when the altered mG-CSF DNA sequence 
according to the invention 1s placed in the same vector under the same 

25 promoter RBS control, a protein fraction 1s produced having mG-CSF 
activity. 

The Invention also concerns muteins of G-CSF 1n which the N- 
terminal methionine is not present when G-CSF 1s produced from a 
recombinant host. In prokaryotic recombinant hosts such as col_j_. 
30 the translation initiation codon ATG encodes the amino acid 
methionine. As a result, the amino or NH 2 -terminus of recombinant 
eukaryotic proteins produced in these recombinant hosts has a 
methionine residue that is generally not found in the protein when 
produced in the eukaryotic cell from which the DNA was initially 
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obtained. This superfluous NH 2 -terminal methionine, however, can be 
processed and removed by certain exopeptidase enzymes, most 
particularly methionine amino peptidase or met amino peptidase- This 
enzyme although capable of removing N^-terminal amino acid residues, 
5 specifically does not remove the N^-terminal methionine with equal 
efficiency from all proteins. In particular, the efficiency of the 
enzyme varies depending upon the Identity of the amino acid residue 
adjacent to the NH 2 -terminal methionine. 

The precise specificity of the enzyme in this regard cannot 

10 be determined without testing nearly an infinite number of substrates; 
however, a useful rule of thumb is set forth by Sherman, F. , et al . , 
Bio Assays 3:27-31 (1985). Sherman et al . based their considerations 
for specificity of Met-ami no-peptidases on the observed forms of 
mutants of iso-l-cytochrome-C from yeast and the published primary 

15 sequence of 82 mature intracellular proteins. They conclude that 
methionine is usually cleaved from residues with a side chain having a 
radius of gyration of 1.29 A or less, but generally not cleaved from 
residues with side chains larger than 1.43 A. This is consistent with 
the observation that mutationally altered iso-l-cytochrome-C taken in 

20 consideration with other published sequences of other proteins from 
procaryotic and eucaryotic systems indicate that N-terminal methionine 
1s cleaved when it precedes residues of alanine, cysteine, glycine, 
proline, serine, threonine, or valine, but not when it precedes 
residues of arginine, asparagine, aspartic acid, glutamine, glutamic 

25 acid, isoleucine, leucine, lysine, or methionine. These results are 
generally consistent with those set forth in the illustrations 
below. However, some exceptions occur where the radius of gyration 
for the second amino acid is higher than 1.29 A and the secondary and 
tertiary structure or other conditions are particularly favorable. 

30 Therefore, this aspect of the specificity of the enzyme is intended as 
a general guideline and it should be borne in mind that even the 
specificity of particular amino peptidase have not been determined 
with exact precision, i.e., not all possible tertiary structures have 
been tested. Therefore, in order to fall within the definition of 

35 "Met-ami nopeptidase", the enzyme needs only to meet the requirement of 
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specific cleavage of N-terminal methionine without cleavage of 
internal methionine residues and without cleavage of N-terminal 
residues other than methionine from any peptide. 

Among the NH 2 -terminal amino acid sequences that may be 
5 processed by amino terminal methionine peptidase, are those in which 
the methionine is followed by ala, gly or pro. In addition, NH 2 - 
terminal methionine followed by ala-met, gly-met, ala-ser, gly-gly, 
ala-pro, or pro-thr is expected to be processed. 

Thus, any of the form of G-CSF or muteins of G-CSF in which 

10 the above-mentioned sequences comprise the N-terminal portion of the 
amino acid will fall within the definition of G-CSF according to the 
invention so long as the biological activity of G-CSF is maintained. 
In particular, a deletion mutant of G-CSF, in which the threonine 
residue following the NH 2 -terminal methionine is deleted to yield a 

15 Vj-G-CSF is illustrated in some detail hereinbelow. Specifically 
mutated forms of G-CSF in which the thr x residue is substituted with 
ala, or gly, ala-ser, gly-gly, or in which the order of pro and thr 
are reversed will also fall within the scope of the invention. DNA 
species encoding the above-indicated muteins of G-CSF may be made by 

20 site specific mutagenesis of the cDNA encoding native G-CSF using 
known techniques. Oligonucleotides encoding these muteins may also be 
substituted for DNA sequences encoding the native G-CSF amino terminal 
amino acid sequence through ligation of ol igodeoxyribonucleotides 
encoding the altered sequenced into specifically digested G-CSF DNA 

25 sequences. 

B. General Methods for Carrying out the Invention 
Transformations 

Depending on the host cell used, transformation is done 
using standard techniques appropriate to such cells. The calcium 
30 treatment employing calcium chloride, as described by Cohen, S. N. , 
Proc. Natl. Acad. Sci. (USA) (1972) 69:2110, or the RbCl 2 method 
described in Maniatis, et al., Molecular Cloning: A Laboratory Manual 
(1982) Cold Spring Harbor Press, p. 254 was used for procaryotes or 
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other cells which contain substantial cell wall barriers. For 
mammalian cells without such cell walls, the calcium phosphate 
precipitation method of Graham and van der Eb, Virology (1978) 52 : 546 
may be used or the method of Wang, et al., Science, 228 :149 (1985). 
5 Transformations into yeast are carried out according to the method of 
Van Solingen, P., et al., J. Bact. (1977) 130 :946 and Hsiao, C. L., et 
a!., Proc. Natl. Acad. Sci. (USA) (1979) 26:3829. 

Screening cDNA Libraries 

cDNA libraries are screened using the colony hybridization 
10 procedure. Lifts of colonies are made onto nitrocellulose filter 
papers (S & S type BA-85). The colonies are lysed and DNA fixed to 
the filter by treatment for five minutes with 0.5 M NaOH, 1.5 M NaCl, 
and are then washed twice for five minutes each time with 1.0 H Tris 
pH 8, 3 M NaCl. Filters are air dried and baked at 80°C for two 
15 hours. The duplicate filters are prehybridized at 45~50°C for one 
hour in 5 x SSC, 10 x Denhardt's solution (0.2% polyvinylpyrrolidone, 
0.2% Ficoll, 0.2% BSA ), 0.1% SDS, 50 mM sodium phosphate pH 7.0, and 
100 ug/ml tRNA. 

Hybridization of the filters is done in a solution similar 
20 to that described above for prehybri dization, but also contains 10% 
dextran sulfate with kinased probe at 1 x 10 6 CPM/ml under conditions 
which depend on the stringency desired. Typical moderately stringent 
conditions employ temperatures of 45-50°C for 16-20 hours with 1-5 
ml /filter of DNA hybridization buffer containing probe. For higher 
25 stringencies higher temperatures are employed. The filters are washed 
three times for 15 minutes each time at appropriate temperatures using 
3 x SSC, 0.1% SDS, air dried, and are autoradiographed at -70°C for 
two to three days. 

Vector Construction 

30 Construction of suitable vectors containing the desired 

coding and control sequences employs standard ligation and restriction 
techniques which are well understood in the art. Isolated pi asmids, 
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DNA sequences, or synthesized oligonucleotides are cleaved, tailored, 
and rell gated 1n the form desired. 

Site specific DNA cleavage 1s performed by treating with the 
suitable restriction enzyme (or enzymes) under conditions which are 
5 generally understood in the art, and the particulars of which are 
specified by the manufacturer of these comtnerdaly available 
restriction enzymes. See, e.g., New England Biolabs. Product 
Catalog. In general, about 1 w 9 of plasmid or DNA sequence is cleaved 
by one unit of enzyme in about 20 v l of buffer solutions. In the 
10 examples herein, typically, a 3-10 fold excess of restriction enzyme 
is used to insure complete digestion of the DNA substrate. Incubation 
times of about one hour to two hours at 37°C or other appropriate 
temperatures are workable, although variations can be tolerated. 
After each incubation, protein is removed by extraction with 
15 phenol /chloroform, and may be followed by ether extraction, and the 
nucleic acid recovered from aqueous fractions by precipitation with 
ethanol followed by running over a Sephadex G-50 spin column. If 
desired, size separation of the cleaved fragments may be performed by 
polyacyUmide gel or agarose gel electrophoresis using standard 
20 techniques. A general description of size separations 1s found in 
Methods in Enzymology (1980) 65:499-560 or Maniatls, Molecular 
Honing; A Laboratory Manual . Cold Spring Harbor Laboratory. Cold 
Spring Harbor, NY (1982). 

Restriction cleaved fragments may be blunt ended by treating 
25 with the large fragment of E. coli DNA polymerase I (Klenow) 1n the 
presence of the four deoxynucleotide triphosphates (dXTPs) using 
incubation times of about 15 to 25 minutes at 20 to 25'C 1n 50 mM Tr1s 
P H 7.6, 50 mM NaCl . 6 mM MgCl 2 . 6 mM DTT and 5-10 V M dXTPs. The 
Klenow fragment fills 1n at 5' overhangs but chews back protruding 3' 
30 single strands, even though the four dXTPs are present. If desired, 
selective repair can be performed by supplying only one of the. or 
- selected, dXTPs within the limitations dictated by the nature of the 
sticky ends. After treatment with Klenow, the mixture is extracted 
with phenol /chloroform and ethanol precipitated followed by running 
35 over a Sephadex G-50 spin column. Treatment under appropriate 
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conditions with SI nuclease results In hydrolysis of any single- 
stranded portion. 

Synthetic oligonucleotides are prepared by the triester 
method of Matteucci, et al . ( J. Am. Chem. Soc. (1981) 103 :3185) or 
5 using commercially available automated oligonucleotide synthesizers. 
Kinasing of single strands prior to annealing or for labeling is 
achieved using an excess, e.g., approximately 10 units of 
polynucleotide kinase to 0,1 nmole substrate in the presence of 50 mM 
Tris, pH 7.6, 10 mM MgCl 2 , 5 mM dithiothrei to! , 1-2 mM ATP, 1.7 pmoles 
10 *y-32P-ATP (2.9 mCi/mmole), 0.1 mM spermidine, 0.1 mM EDTA. 

Ligations are performed in 15-30 yl volumes under the 
following standard conditions and temperatures: 20 mM Tris-Cl pH 7.5, 
10 mM MgCl 2 , 10 mM DTT, 33 yg/ml BSA, 10 mM-50 mM NaCl , and either 40 
yM ATP, 0.01-0.02 (Weiss) units T4 DNA ligase at 0°C (for "sticky end" 

15 ligation) or 1 mM ATP, 0.3-0.5 (Weiss) units T4 DNA ligase at 14°C 
(for "blunt end" ligation). Intermol ecul ar "sticky end" ligations are 
usually performed at 33-100 yg/ml total DNA concentrations (5-100 nM 
total end concentration). Intermol ecul ar "blunt end" ligations 
(usually employing a 10-30 fold molar excess of linkers) are performed 

20 at 1 pM total ends concentration. 

In vector construction employing "vector fragments", the 
vector fragment is commonly treated with bacterial alkaline 
phosphatase (BAP) in order to remove the 5' phosphate and prevent 
religation of the vector. BAP digestions are conducted at pH 8 in 

25 approximately 150 mM Tris, in the presence of Na + and Mg +2 using about 
1 unit of BAP per yg of vector at 60°C for about one hour. Vector 
fragments subjected to this treatment are referred to herein as 
"bapped". In order to recover the nucleic acid fragments, the 
preparation is extracted with phenol /chl oroform and ethanol 

30 precipitated and desalted by application to a Sephadex G-50 spin 
column. Alternatively, religation can be prevented in vectors which 
have been double digested by additional restriction enzyme digestion 
of the unwanted fragments. 
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For portions of vectors derived from cDNA or genomic DNA 
which require sequence modifications, site specific primer directed 
mutagenesis 1s used. This is conducted using a synthetic 
oligonucleotide primer complementary to a single stranded phage DNA to 

5 be mutagenized except for limited mismatching, representing the 
desired mutation. Briefly, the synthetic oligonucleotide is used as a 
primer to direct synthesis of a strand complementary to the phage, and 
the resulting double-stranded DNA 1s transformed into a phage- 
supporting host bacterium. Cultures of the transformed bacteria are 

10 plated in top agar, permitting plaque formation from single cells 
which harbor the phage. 

Theoretically, 50% of the new plaques will contain the phage 
having, as a single strand, the mutated form; 50% will have the 
original sequence. The resulting plaques are hybridized with kinased 

15 synthetic primer at a temperature which permits hybridizaton of an 
exact match, but at which the mismatches with the original strand are 
sufficient to prevent hybridization. Plaques which hybridize with the 
probe are then picked, cultured, and the DNA recovered. Details of 
site specific mutation procedures are described below in specific 

20 examples. 

Verification of Construction 

In the constructions set forth below, correct ligations for 
plasmid construction are confirmed by first transforming E. col i 
strain MM294 obtained from E. col i Genetic Stock Center, CGSC 6135, or 

25 other suitable host with the ligation mixture. Successful 
transformants are selected by ampicillin, tetracycline or other 
antibiotic resistance or using other markers depending on the mode of 
plasmid construction, as is understood in the art. Plasmids from the 
transformants are then prepared according to the method of Clewell, D. 

30 B., et a!.. Prnc. Natl. Acad. Sci. (USA) (1969) 62:1159, optionally 
following chloramphenicol amplification (Clewell, D. B., J. Bacteriol. 
(1972) UO:667). The isolated DNA is analyzed by restriction enzyme 
mapping and/or sequenced by the dideoxy method of Sanger, F., et al., 
Proc. Natl. Acad. Sci. (USA) (1977) 24:5463 as further described by 
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Messing, et al . , Nucleic Acids Res, (1981) 2:309, or by the method of 
Maxam, et al., Methods in Enzymology (1980) 65:499. 

Hosts Exemplified 

Host strains used in cloning and expression herein are as 

5 follows: 

For cloning and sequencing, and for expression of 
construction under control of most bacterial promoters, E. coll strain 
MM294 (supra), Talmadge, K. , et al., Gene (1980) J^:235; Messelson, 
M., et -a l . , Nature (1968) 217 :1110, was used as the host. For 
10 expression under control of the N-RBS promoter, E. col i strain K12 
MC1000 lambda lysogen, N 7 N 53 cI857SusP 80 , ATCC 39531 (hereinafter 
sometimes referred to as MC1000-39513 XDG95 or DG95) is used. 

For M13 phage recombinants, E. coli strains susceptible to 
phage infection, such as E. coll K12 strain DG98 are employed. The 
15 DG98 strains has been deposited with ATCC July 13, 1984 and has 
accession number 1965. 

In addition to bacteria, eucaryotic microbes, such as yeast, 
may also be used as hosts. Laboratory strains of Saccharomyces 
cerevisiae , Baker 1 s yeast, are most used although a number of other 

20 strains are commonly available. While vectors employing the 2 micron 
origin of replication are illustrated, Broach, J. R., Meth. Enz. 
(1983) 101 :307, other plasmid vectors suitable for yeast expression 
are known (see, for example, Stinchcomb, et al., Nature (1979) 282 :39, 
Tschempe, et al., Gene (1980) Kl :157 and Clarke, L. , et al., Meth. 

25 Enz. (1983) 101 :300 ). Control sequences for yeast vectors include 
promoters for the synthesis of glycolytic enzymes (Hess, et al., J. 
Adv. Enzyme Reg. (1968) 7:1*9; Holland, et al., Biochemistry (1978) 
^^900 ). Additional promoters known in the art include the promoter 
for 3-phosphoglycerate kinase (Hitzman, et al . , J. Biol . Chem. (1980) 

30 255 :2073 ), and those for other glycolytic enzymes, such as 
glyceral dehyde-3-phosphate dehydrogenase, hexosekinase, pyruvate 
decarboxyl ase, phosphof ructoki nase, gl ucose-6-phosphate i somerase , 3- 
phosphoglycerate rnutase, pyruvate kinase, triosphosphate isomerase, 
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phosphoglucose isomerase, and glucokinase. Other promoters, which 
have the additional advantage of transcription controlled by growth 
conditions are the promoter regions for alcohol dehydrogenase 2, 
isocytochrome C, acid phosphatase, degradative enzymes associated with 
nitrogen metabolism, and enzymes responsible for maltose and galactose 
utilization (Holland, Ibid). Alterations to the 5* end of the mG-CSF 
DNA sequence analogous to those shown to be effective with the Trp 
promoter and P L promoter may also be carried out for use with such 
eukaryotic promoters. 

It is also believed terminator sequences are desirable at 
the 3' end of the coding sequences. Such terminators are found in the 
3' untranslated region following the coding sequences in yeast-derived 
genes. Many of the vectors illustrated contain control sequences 
derived from the enolase gene containing plasmid peno46 (Holland, M. 
15 J., et a!., J. Biol. Chem. (1981) 256:1385) or the LEU2 gene obtained 
from YEpl3 (Broach, J., et al., Gene (1978) 8; 121), however any vector 
containing a yeast compatible promoter, origin of replciation and 
other control sequences is suitable. 

It is also, of course, possible to express genes encoding 
20 polypeptides 1n eukaryotic host cell cultures derived from 
multicellular organisms as described further hereinbelow. See, 
generally for example, Tissue Culture , Academic Press, Cruz and 
Patterson, editors (1973). Useful host cell lines include murine 
myelomas N51, VERO and HeLa cells, COS-7 cells, and Chinese hamster 
25 ovary (CHO) cells. Expression vectors for such cells ordinarily 
include promoters and control sequences compatible with mammalian 
cells such as, for example, the commonly used early and late promoters 
from Simian Virus 40 (SV 40) (Fiers, et al., Nature (1978) 273:113), 
or other viral promoters such as those derived from polyoma, 
30 Adenovirus 2, bovine papiloma virus, or avian sarcoma viruses, ..or 
immunoglobin promoters and heat shock promoters. General aspects of 
mammalian cell host system transformations have been described by 
Axel, U.S. Patent No. 4,399,216 issued August 16, 1983. It now 
appears, also that "enhancer" regions are Important in optimizing 
35 expression; these are, generally, sequences found upstream of the 
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promoter region. Origins of replication may be obtained, if needed, 
from viral sources. However, integration into the chromosome is a 
common mechanism for DNA replication In eucaryotes. Plant cells are 
also now available as hosts, and control sequences compatible with 
5 plant cells such as the nopal 1ne synthase promoter and 
polyandenylation signal sequences (Deplcker,, A., et al., J. Mol . 
Appl. Gen. (1982) _1:561) are available. ~ 

The invention will be better Illustrated in relationship to 
the following examples which are Intended by the Inventors to be 
10 exemplary and non-1 1m1 ting. 

Example I 

Isolation and Expression of cDNA Encoding Human G-CSF 

A cDNA clone encoding human granulocyte stimulating factor 
was isolated from the MIA PaCa-2 cell Hne, and was expressed using a 

15 recombinant vector 1n COS-7 cells. The MIA PaCa-2 cell Hne described 
1n Yunls, A. A. et al., Experimental Hematol . 12:838-843 (1984), 1s an 
established cell line pubHcally available from the Cell Repository 
Lines (CRL) collection of the American Type Culture Collection, 12301 
Parklawn Avenue, Bethesda, MD 20895 under accession number ATCC CRL 

20 1420. The cDNA clone was sequenced and the corresponding amino acid 
sequence was deduced. 

A. Initial Detection of mRNA From MIA PaCa-2 Cells H aving High BMP 
Activity, But Low Hybridization to CSF-1 Probes 



25 



30 



The human derived pancreatic carcinoma cell line MIA PaCa-2 
was used as a source of mRNA to validate CSF-1 specific probes and for 
the formation of a cDNA library containing an Intronless form of the 
human CSF-1 coding sequence. The MIA PaCa-2 cell line produces CSF-1 
at a level approximately 10 fold below that of the murine L-929 cells. 

Negative control mRNA was prepared from MIA PaCa-2 cells 

maintained in serum- free medium, I.e., under conditions wherein they 

do not produce CSF-1. Cells producing CSF-1 were obtained by 
relnducing CSF-1 production after removal of the serum. 
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Cells were grown to confluence In roller bottles using 
Dulbecco's Modified Eagles' Medium (DMEM) containing 10% fetal calf 
u r.nd produce CSF-1 at 2000-5000 units/ml. The eel! cultures 
were washed, and relncubated 1n serum-free condition to suppress CSF-1 
5 Cation. For negative controls, no detectable CSF-1 was produced 
!£r a day or two of incubation without serum. Relnduced cells were 
obtained by addition of phorbol myr.stlc acetate (100 ng/ml ) to obtain 
production after several days of 1000-2000 units/ml. 

The MIA PaCa-2 mRNA was Isolated by lysis of the cell 1n 
.0 isotonic buffer with 0.5% NP-40 in the presence of 

vanadyl complex (Berger. S. L. » et .1., Biochemistry (19 79) J£.51«) 
f.Howed by phenol/cMoroform extraction, ethanol premutation, a d 
ol1 go dT chromatography, and an enriched mRNA preparat on obtained 
In more det.1l. cells are washed twice in PBS (phosphate buffered 
15 aline) and are resuspended in IHB (140 m Nad. 10 mH Tr.s, 1.5 mM 
M 9 C1 2 . PH 8) containing 10 rti vanadyl adenosine complex (Berger. S. 
L . et. al.. supra ). 

A non-ionic detergent of the ethylene oxide polymer type 
w _40) is added to 0.5% to lyse the cellular, but not nuclear 
2 o Lranes. Nuclei are removed by centrifugation at 1,000 x g for 10 
m1 „„tes. The post-nuclear supernatant is added to t~ vol ume of T 
,X0 m Tris. 1 mH ethylenedlaminetetraacetic acd (EOTA . pH 7.5) 
saturated phenol chlorofonn (1:1) and adjusted to 0.5% sod,um dodecyl 
ulfate (SOS) and 10 m EDTA. The supernatant is re-extracted four 
25 t mes and phase separated each time by centrifugation at 2 000 x g for 
10 minutes. The RNA is precipiated by adjusts the sample to 0 25 H 
Had. adding 2 volumes of 100% ethanol and stonng at -20 C. The RNA 
,s pelleted at 5.000 x g for 30 minutes, is washed with 70% and 100% 
ethanol. and is then dried. Polyadenyl ated (polyA + ) messenger RNA 
30 (mRNA) is obtained from the total cytopiasmic RNA by «h«t^p» « 
ol igo dT cellulose (Aviv. J., et al.. Proc. Natl. Acad. «"• <»"> 
69-1408-1412). The RNA is dissolved In ETS (10 mm Tns, 1 * EDTA, 
Ts% SDS pH 7.5) at a concentration of 2 mg/ml . This solution ,s 
heated to 65-C for five minutes, then ,u,cxly chilled to 4«C. After 
35 bringing the RNA solution to room temperature. It is adjusted to 0.4 M 
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NaCl and 1s slowly passed through an oil go dT cellulose column 
previously equilibrated with binding buffer (500 mM NaCl . 10 mM Tris, 
1 mM EDTA, pH 7.5 0.5% SOS). The flow-through Is passed over the 
column twice more. The column is then washed with 10 volumes of 
5 binding buffer. PolyA + mRNA Is eluted with allquots of ETS, extracted 
once with TE-saturated phenol chloroform and Is precipitated by the 
addition of NaCl to 0.2 M and 2 volumes of 100% ethanol . The RNA 1s 
repreci pita ted twice, 1s washed once 1n 70% and then 1n 100% ethanol 
prior to drying. 

10 Total. mRNA was subjected to 5-20% by weight sucrose gradient 

centrifugation in 10 mM Tris HC1 , pH 7.4, 1 mM EDTA, and 0.5% SOS 
using a Beckman SW40 rotor at 20°C and 27,000 rpm for 17 hours. The 
mRNA fractions were then recovered from the gradient by ethanol 
precipitation, and injected Into Xenopus laevls oocytes 1n the 

15 standard translation assay. The oocyte products of the RNA fractions 
were assayed in the bone marrow proliferation assay of Moore, R. N. et 
al., J. Immunol. (1983) 131 :2374, and of Prystowsky, M. B., et al . , 
Anu J. Pathol. (1984) 114:149 and the fractions themselves were 
assayed by dot blot hybridization to a 32-mer probe corresponding to 

20 the DNA 1n the second exon of the genomic sequence (exon II probe). 
These results are summarized in Figure 1A and B. 

The broken line in Figure 1A shows the response in the bone 
marrow proliferation assay of the supernatants from the Xenopus laevis 
oocytes; Figure IB shows the dot blot results. The most strongly 
25 hybridizing fraction, 11, corresponds to greater than 18S, while the 
most active fractions 8 and 9 correspond to 14-16S. 

The mRNA was also fractionated on a denaturing formaldehyde 
gel, transferred to nitrocellulose, and probed with exon II probe. 
Several distinct species ranging in size from 1.5 kb to 4.5 kb were 
30 found, even under stringent hybridization conditions. 

B. Isolation G-CSF mRNA From MIA PaCa-2 Cells 

Confluent MIA PaCa-2 cells were stimulated in serum free 
Dulbecco's minimum essential medium (DMEM) for 4 days with phorbol 
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myrlstate acetate (50 ng/ml ) and retlnolc add (10 pM). The RNA was 
prepared as described by Chirguin et al.; briefly, the cells were 
lysed in 5 M guanidine Isothiocyanate followed by centrif ligation 
through a 5.7 M cesium chloride (CsCl) cushion; poly A + RNA was 
5 prepared from the lysed cells by one selection cycle on oligo (dT) 
cellulose as described in Maniatis et al., supra. 

In more detail, cells were lysed In a solution which 
contained 5 M guanidine isothiocyanate, 0.025 M Na-c1trate. pH 7, 0.5% 
sarcoslne and 8% B-mercaptoethanol . Molecular biology grade CsCl was 

10 made up to 5.7 M or 40S w/v and buffered with 0.02 M Tris pH 7.5 and 
0.002 M Na-EDTA. All solutions were prepared under RNase free 
conditions and passed through 0.45 y Millipore filters before use. 
The lysed cells were then layered onto SW28 ul tracentrifuge tubes 
which contained layers of 10 ml 5.7 M CsCl and 6 ml 40% CsCl. 

15 After centrifugation at 26,000 rpm for 18 hours, the RNA 

pellet was dissolved in dH 2 0 and ethanol precipitated twice. 
Polyadenylated (PolyA + ) messenger RNA (mRNA) was obtained by 
chromatography of the total RNA on oligo (dT) cellulose as described 
in Maniatis, supra (at page 197). 

20 The RNA is dissolved in sterile H 2 0 and heated to 65°C for 

five minutes. An equal volume of a solution containing 0.040 M Tris 
CI pH 7.6 1.0 M NaCl, 0.002 M EDTA and 0.2% SDS is quickly added and 
the sample 1s cooled to room temperature. The sample 1s then loaded 
on an oligo (dT) column that has been equilibrated with a buffer 

25 containing 0.020 M Tris pH 7.6. 0.5 M NaCl, 0.001 M EDTA and 0.1% 
SDS. The flow through is collected, heated to 65°C, cooled to room 
temperature and passed over the column once more. The column is then 
washed with 10 volumes of wash buffer (0.02 Tris, 0.1 M NaCl. 0.001 M 
EDTA. 0.1% SDS). Poly A + mRNA 1s eluted with aliquots of 0.01 M Tris 

30 pH 7.5, 0.001 M EDTA and ethanol -precipitated twice. 

334 wg of MIA PaCa-2 Poly A + mRNA was fractionated to 5-25% 
by weight sucrose gradient centrifugation in 20 mM Tris HC1 , pH 7.5, 1 
mM EDTA, and 0.5% sarcosine using a Becknian SW40 rotor at 20°C and 
27,800 rpm for 16 hours. The mRNA fractions were collected in 400 ul 
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fractions and ethanol precipitated twice. Fractions were pooled and 
resuspended 1n 15 yl H 2 0. 

Northern blots were prepared by the electrophoresis of 5 v g 
of poly A + RNA per lane or 1 yl of each pooled RNA fraction through 1% 
5 agarose gels containing 0.5 M formaldehyde followed by blotting onto 
nitrocellulose filters. After baking the filters for 1.5 hours at 
80°C, they were prehybridized (5 x SSC, 10 x Denhardt's solution (0.2* 
polyvinylpyrrolidone, 0.2% Flcoll, 0.2% BSA), 0.1% SDS, 50 mM sodium 
phosphate pH 7.0, and 100 yg/ml tRNA) for one hour at 55°C. The blots 

10 were hybridized for 16 hours at 55°C 1n a similar solution that also 
contained 10* dextran sulfate and 10 6 cpm per ml of an oligonucleotide 
probe labeled with Y 32 P-ATP and polynucleotide kinase. The blot shown 
1n Figure 3B was washed at 55°C in 3 x SSC, 0.1* SDS. This 
oligonucleotide probe had the sequence 5 * -GTAGGTGGCACACAGCTTCTCCTG-3 ' 

15 and was designed based on the sequence of the CHU-2 cDNA clone 
described by Nagata et al., Nature, 319: 415-418 (1986). 

The RNA pools that were described in the above assay were 
translated 1n a Xenopus laevis oocytes assay by the Injection of 50 nl 
of of RNA Into each oocyte as described in Gurdon et al., Nature, 
20 233 ; 177-180 (1971). Supernatants of 10 y l per oocyte were collected 
after 40 hours and assayed for CSF activity. 

The Xenopus laevis supernatants from each hybridization 
fraction were assayed for CSF activity in a murine bone marrow cell 
proliferation assay as described in Moore, et al., J. Immunol. . 

25 in :2374-2378 (1983). Briefly, in this assay, 5 x 10 4 murine bone 
marrow cells per/well were incubated in 95-well plates (12 x 8) with 
serially diluted Xenopus laevis oocyte supernatants made from 
positively hybridizing mRNA fractions. After three days, 3 H thymidine 
(0.5 uCi/well) was added, and after six hours the cells were harvested 

30 and counted In a liquid scintillation counter. 

Peak bone marrow proliferation was found 1n Xenopus laevis 
oocyte supernatants made with mRNA fractions that were most strongly 
positive 1n Northern blots with the above-described oligonucleotide 
probe as shown 1n Figure 3. 
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C. identification of G-CSF Clones In a MI A PaCa-2 cDNA Library 

A cDNA library was prepared from the enriched MIA PaCa-2 
mR NA as described 1n Kawasaki, et al.. Science, 30:291-296 (1985). 
Briefly, the method used oil go (dT) priming of the poly A + tails and 
AMV reverse transcriptase employing the method of Okayama, H. et al., 
Mol. Cell Biol. , .3:280-289 (1983) This method results in a higher 
proportion of full length clones than does poly (dG) tailing and 
effectively uses as host vector portions of two vectors therein 
described, and readily obtainable from the authors, pcDVl and pLl. 
The resulting vectors contain the insert between vector fragments 
containing proximal BamHI and Xhol restriction sites; the vector 
contains the P BR322 origin of replication, and ampicillin resistance 
gene and SV40 control elements which result in the ability of the 
vector to effect expression of the inserted sequences in COS-7 cells. 

A 1.2 x 10 6 clone library 1n E. coll obtained from the above 
enriched MIA PaCa-2 mRNA by the Okayama and Berg method was then 
probed using the same oligonucleotide probe that yielded a positive 
signal on the most active pooled MIA PaCa-2 mRNA fractions. To probe 
the library, E^coli containing the Okayama-Berg vectors were grown up 
on nutrient medium. Colonies were lifted onto nitrocellulose filter 
papers and were lysed. DNA was fixed to the filter by treatment for 
five minutes with 0.5 mM NaOH, 1.5 M NaCl. Filters were then washed 
twice for five minutes each time with 1.5 M Tris pH 8, 3 M NaCl and 
were air dried and baked at 80°C for two hours. 

The filters for the screening were prehybridized and 
hybridized to the y 32 P labeled probe as described above 1n Section B 
of the example, but both prehybridization and hybridization were 
carried out at 50°C. Plasmids P P12 and pP28 were determined to be 
probe positive and were further characterized. 

30 D. Sequencing of 6-CSF MIA Pa r.a-2 Plasmids cDNA 

pP12 was digested with BamHI and subcloned into a M13mpl9 
vector and sequenced using the dldeoxy chain-termination method. 
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The DNA sequence and predicted protein sequence of pP12 are 
shown in Figure 4. The cDNA insert in pP12 is 1510 base pairs long 
excluding the poly (dG) and poly (dA) tails; it contains 11 more bases 
of 5' untranslated sequence than the CHU-2 G-CSF clone. The major 
5 difference between this clone derived from MIA PaCa-2 and CHU-2 cDNA 
clones of Nagata et al. is a 9 base pair Insertion (GTGATGGAG) in the 
CHU-2 clone that would encode the amino acid residues Val-Ser-Glu just 
prior to cys-36 in the MIA PaCa-2 G-CSF as indicated by the arrow in 
Figure 4. There are two other differences; an A at position 588 In 
10 the MIA PaCa-2 clone (G in the CHU-2 clone) is a silent third base 
change, and a T at position 1237 1n the MIA PaCa-2 clone (C 1n the 
CHU-2 clone) 1s 1n the 3' untranslated region. 

£• Activity of the pP12 and pP28 Proteins Produced in COS -7 Cells 

Plasmids pP12 and pP28 were purified using a CsCl gradient 
15 and COS-7 cells were transfected using a modification (Wang, A. M. , et 
al.. Science, 228 :149 (1985)) of the calcium phosphate coprecipi tation 
technique. After Incubation for three days, supernatants were 
collected and G-CSF production was assayed in a colony forming assay 
and the murine bone marrow proliferation assay, as described above. 

20 CSF activity was measured by both murine bone marrow cell 

proliferation and colony formation assays. For the colony assay, 1 x 
10 5 mononuclear bone marrow cells from Balb/c mice were used in a 
total of 1 ml, containing 5% cell supernatant, 152 fetal calf serum, 
and 0.3% agar In RPMI 1640 medium. Colonies of greater than 40 cells 

25 were counted after seven days, and cell type determined by 
cytocentrlfuglng Individual colonies and staining with a modified 
Wright stain. Each unit of CSF stimulates the formation of one 
colony. For a control, COS-7 cells were mock transfected under the 
same conditions by carrying the cells through the transfection steps 

30 without exposure to a plasmld. The results of the assay are described 
below 1n Table I. 
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TABLE I 

Production of CSF Activity bv Transfected COS-7 Cells 







Colony Assay 




Transfection 


Proliferation (cpm) 


Units /ml 


%G a 


%GM b 


pP12 


4595 


>280 


58 


42 


pP28 


6594 


>280 


83 


17 


Mock 


633 


0 


0 


0 



a = Colonies containing only granulocytes. 

b = Colonies containing both granulocytes and macrophages. 

10 EXAMPLE II 

nonina G-CSF for Expression in E. coli 

A . Preparation of G-CSF Sequence: Insert ion of 5' Hindlll 
and ATG Codon 

In order to facilitate manipulation of the G-CSF DNA 

15 sequence in various cloning and expression vectors, the G-CSF sequence 
was altered by inserting a Hindlll site and ATG codon in mG-CSF 3 of 
the GCC codon encoding alanine (-1). 

Plasmid pP12 was digested with BamHI endonucTease and the 
digest was electrophoresed in a Tris-acetate. 0.5% low-melting point 

20 agarose gel. The small fragment band carrying the G-CSF DNA sequence 
was melted and ligated into BamHI -digested M13mpl9 using T4 DNA ligase 
under sticky end conditions. After ligation, the mixture was used to 
transform coli strain DG98. ATCC accession number 39,768. The 
transformed cells were plated In the presence of 0.3 mM 

25 isopropylthlogalactoside (IPTG) obtained from Sigma Chemicals (St. 
Louis, MO) and 0.3 mM X-gal on a lawn of DG98 and grown at 37°C. Non- 
alpha complementary white plaques were grown in liquid broth and a 
sample of the culture was used to purify repllcative form (RF) DNA. 
The presence of the insert was confirmed by BamHI digestion and sizing 

30 the fragment on a 0.7% agarose gal. Phage having the expected insert 
were designated pA3. 
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A Jlindlll site and ATG Initiation codon were placed in the 
G-CSF sequence adjacent to and 1n frame with the codon encoding the N- 
terminal threonine of the mG-CSF sequence by means of site-specific 
mutagenesis using a chemically synthesized purified 33-mer 
oligodeoxy ribonucleotide having the following sequence: 

5'-GTG CAG GAA GCC AAG CTT ATG ACC CCC CTG GGC-3'. 

The Hindu I site immediately upstream of the ATG codon 
permits the convenient cloning of the mG-CSF-encoding DNA sequence 1n 
subsequent manipulation. Approximately 10 pmoles (pM) of the 33-mer 
was hybridized to about 1 pM of single stranded DNA from pA3 In about 
13 yl of 10 mM Tris pH 7.4, 90 mM NaCl , 10 mM MgCl 2 by heatirig to 85°C 
for five minutes, followed by 45°C for 20 minutes. The annealed 
mixture was chilled on 1ce and adjusted to 18 v l by the addition of 
dlthiothrletol to 10 mM, each dXTP to 0.5 mM and 5 units of DNA 
Polymerase I Klenow fragment. The reaction mixture was Incubated on 
1ce for 20 minutes followed by Incubation at room temperature for one 
hour. The repair reaction mixture was then used to transform E. coli 
strain DG98 as above, plated onto agar plates and Incubated overnight 
to obtain phage plaques. 

Plaques were blotted using nitrocellulose filter circles and 
the filters were treated as described to lyse the cells, denature the 
DNA, neutralize, rinse, fix the DNA to the filter and incubate In 
prehybridization buffer. The 33-mer oligonucleotide was end labeled 
with Y 32 P using polynucleotide kinase and hybridized to the filters at 
45°C overnight. Filters were washed and subjected to 

autoradiography. Probe positive plaques were- grown in liquid culture 
and characterized by BamH I Hindlll digestion and agarose gel sizing to 
confirm the presence of the expected fragments. One clone having the 
desired Insert was designated pApooL. pApooL was digested with BamH I 
and Hindlll. The approximately 1500 base pair BamHI-Hindlll fragment 
was Isolated, purified, and retained for ligation into a plasmid 
having the desired promoter and terminator sequences. 
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B. Placement of G-CSF Coding Sequence Under Trp Control 

To place the G-CSF coding sequence under Trp control and 
under the Influence of the BT positive retroregul atory element (PRE), 
the G-CSF coding sequence was cloned into plasmid pAM703. pAW703 1s a 
5 plasmid having the Trp promoter and PRE in operable linkage with a 
Hindlll-BamHI fragment which contains the gene encoding human tumor 
necrosis factor (TNF). 

As described below in detail, pAW703 1s digested with BamHI 
and Hindi I I and the BamH I and Hindi I I fragment of pApool encoding G- 
.0 CSF is ligated in place of the TNF fragment. 

The following describes the construction of pAW703 and the 
cloning of G-CSF into this plasmid to make pPDl. An alternate 
construction using pTrp3 is also described. 
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B.l. Construction of pAW703 

Plasmid pAW703 was made from plasmids pAW701 and pAW711 as 
follows. Plasmid pAW711 (ATCC 39,918) which contains a 400 bp BamHI- 
Sall fragment encoding the PRE was digested with Sail and BamHI 
endonucleases under conditions suggested by the supplier to yield an 
approximately 400 bp fragment containing the PRE. The digest was 
loaded on a 1% agarose preparatory gel and the small Sail -BamHI 
fragment containing the PRE was isolated, electroeluted and retained. 

Plasmid pAW701 was used to obtain the Trp control sequences 
for regulating expression of the gene. 

pAW701, a derivative of plasmid pTrp3, was made by digesting 
P Trp3 with BamH I and Hindi 1 1 endonucleases and ligation of the large 
Hindlll-BamHI fragment with Hindlll-BamHI fragment carrying the TNF 
gene. In the construction of pAW703, pAW701 was digested with BamHI 
and Sail endonucleases. The large BamHI -Sail fragment of pAW701 
containing the Trp promoter Hindi I I site and TNF sequence was isolated 
by electroelution from a preparative agarose gel, and was ligated to 
the small BamHI-Sall fragment containing the PRE, under sticky end 
conditions using T4 DNA ligase to produce pAW703. 
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Both P AW703 and pApool were digested with BaraHI and Hindi I I 
under conditions suggested by the manufacturer. The large Hindi II- 
BajnHI fragment from pApooL was gel Isolated and 11 gated with the 
pAW703 digest using T4 DNA Hgase. 

5 The ligation mixture was used to transform competent E. coli 

strain MM294. Transformants were screened for amplcillin 

resistance. Plasmids from a number of amplcillin resistant colonies 
were isolated, digested with Hindi 1 1 and BamHI and run on a 1.5% 
agarose gel with DNA sizing standards. One clone having the expected 
10 size fragments was designated pPDl. 

B «2. Trp Expression Vectors 

pTr P 3 is a host expression vector containing a Trp promoter 
and RBS immediately 5' of a Hindlll restriction site, thus permitting 
Insertion of a coding sequence having a Hindi 1 1 site Immediately 5' of 
a start codon under control of the Trp promoter. The backbone vector 
for P Trp3 is pB R322. P Trp3 was deposited with the American Type 
Culture Collection, 12301 Parklawn Drive, Rockvllle, MD (ATCC) on 
December 18, 1984, and has accession number 39,946. 

B.2.A. Preparation of pTrp3 

To construct the host vector containing the Trp control 
sequences behind a Hindi 1 1 site, the Trp promoter/operator/ribosome 
binding site sequence, lacking the attenuator region, was obtained 
from pVH153, obtained from C. Yanofsky, Stanford University. Trp 
sequences are available In a variety of such plasmids known in the 
art. P VH153 was treated with Hhal (which cuts leaving an exposed 3' 
sticky end just 5' of the Trp promoter) blunt ended with Klenow and 
partially digested with TagK The 99 bp fragment corresponding to 
restriction at the Tag_I site, 6 nucleotides preceding the ATG start 
codon of Trp leader were isolated, and then Hgated to EcoRI 
30 (repair )/ClaI digested, P BR322 to provide P Trp3. 

An improved form of P Trp3 also Includes a positive 
retroregulatory sequence in a position 1n the vector downstream of the 
coding sequence to be expressed as described hereinbelow. 
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An expression vector for G-CSF having the same control 
sequences as P AW703 may be made directly from pTrp3 (ATTC 39.940) and 
pAM711 (ATCC 39,918) and pP12 as follows. 

pTr P 3 is digested with Hindi I I and Sail. The large Hindlll- 
5 sail fragment is isolated by electrocution from a preparative gel. 
^ small BamHI-Sall fragment containing the PRE is iso at d 
PAW711 as ITscribed above and ligated to the large Hindi -Sail 
fragment of P Tr P 3 under sticky end conditions with T4 DNA ligase. 
may also be made directly by combining the B^-Hlndlll fragment 
l0 of PP12 with the above-mentioned fragments from P Tr P 3 and P AW711 and 
ligating under sticky end conditions with T4 DNA ligase. 

r p 1 acing - BamHI Restriction S ite Downstream of the G-CSF 
Translation Terminat ion Site 

The DNA sequence encoding G-CSF in P A3 contained 
15 approximately 930 nucleotides in the transcribed but untranslated 
region 3' of the translation stop codon TGA in the G-CSF sequence. In 
order to remove these untranslated codons and to place transection 
termination and mRNA stability under the influence of the positive 
retroregulatory Cement, a BamHI site was placed by site-specific 
retroregu a y translation termination 

20 mutagenesis immediately 3 or tne 

si gnal 

A chemically synthesized purified 27-mer deoxyribonucleotlde 
was cheuricaily synthesized using a co-ercial DNA synthesizer and had 
the following sequence: 
25 CCC AGC CCT GAG GAT CCA AGC CCT CCC 

Approximately 10 pM of the 27-mer was hybridized to about 1 
^ of single-stranded DNA from PpA3 In about 13 ul of 10 mM Tris pH 
7 4 90 m Nad. 10 rm HgCl 2 by heating to 86-C for fwe minutes, 
f ol owed by 45-C for 20 minutes. The annealed mixture was chilled on 
30 and adjusted to 18 by the addition of dithiothreitol t. 10 mM 

each dXTP to 0.5 mM and 5 units of DNA Polymerase I Klenow fragment. 
The reaction fixture was Incubated on Ice for 20 minutes, followed by 
incubation at room temperature for about one hour. 
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The reaction mixture was then used to transform coll 
strain DG98, plated onto agar plates and Incubated overnight to obtain 
phage plaques. Plaques were blotted using nitrocellulose filter 
circles arid the filters were treated as described to lyse the cells, 
5 denature the DNA, neutralize, rinse, fix the DNA to the filter and 
Incubate 1n prehybrldization buffer. The 27-mer oligonucleotide was 
end labeled with y phosphate using polynucleotide kinase and 
hybridized to the filters at 45°C overnight. Filters were washed and 
subjected to autoradiography. One of the probe positive plaques was 
10 designated pBpool. 

pBpool was grown in liquid culture and digested with Apal 
and BamH I endonucleases under conditions specified by the 
manufacturer. The coding region of G-CSF has an internal Apal site. 
This digestion generated an Apal -BamH I fragment of approximately 540 
15 base pairs, with the newly inverted BamH I end Immediately 3' of the 
translation stop codon of the mG-CSF gene. 

Plasmid pPDl was also digested with Apal and BamH I under 
conditions suggested by the supplier. This digestion yielded an Apal- 
BamHI fragment in which the Apal site was the same as that described 
above and the BamH I site is 3* of the untranslated region of the G-CSF 
gene. 



20 



The two digests were mixed at a molar excess of digested 
pBpool to pPDl and ligated under sticky end conditions using T4 DNA 
ligase. The ligation mixture was transformed into E. coli strain 
25 MM294. Transformants were screened for ampfcHHn resistance. 
Several colonies were Isolated and digested with Hindi I I and BamHI. A 
colony having the correct length Hindi I I -BamHI insert was designated 
pPD2. 



D. Placing mG-CSF Under P, Control 

Plasmid pLAPHe is a plasmid having the P L promoter and gene 
K-RBS controlling the expression of the Aphs gene. The Aphs gene Is 
contained In a Hindlll-BamHI fragment. The plasmid further contains 
the positive retroregulatory element 3' to the Aphe gene with a BamHI 
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site directly 5' of the PRE. Plasmid pLAphp was made from plasmld 
PFC54.T (ATCC number 39,739). Plasmid pLAPHB 1s exactly the same as 
pFC54 T except that the latter plasmid has a Hlndlll-BamHI fragment 
encoding des-alanyl -IL-2 in place of the Aphe encoding fragment. 
Thus the following steps to place the expression of G-CSF under 
temperature sensitive P L promoter-gene N-RBS and PRE control can also 
be carried out using P FC54.T. The Hindlll-BamHI fragment of P PD2 
having the ATG initiation codon was placed under P L promoter control 
and PRE influence as follows: 

Plasmid pLAPHB was digested with HindHI and BamHI under 
conditions suggested by the supplier. Plasmid P PD2 was digested with 
the same endonucl eases. The digests were combined and ligated with T4 
DNA ligase with the P PD2:pLAPHB digests at a 10:1 molar ratio. 

After ligation, the ligation mixture was used to transform 
competent E^ coVi. host strain XNDG95. Ampicillin resistant colonies 
were selected and plasmids were isolated therefrom, digested with 
BamHI and Hindi I I and a transformant having the expected insert of 
approximately 540 bp was designated pJDl. 

EXAMPLE III 
20 Ex pression of mG-CSF in pPD2 and pJDl 

A. Er nwth and Induction of pPD2 In Tra nsformed £. coli Strains 
T 3M294 and pJDl in Iransto rmed E. coh in XN Ub*t> , 

Initial attempts to produce mG-CSF using the two different 
expression vectors were not successful. The first vector, pJOl. 
(Figure 5) consisted of the DNA encoding the mature mG-CSF protein 
under the control of the bacteriophage x P L promoter and the X gene N- 
RBS The Bacillus thuringiensls positive retroregulating element 
(PRE) is immediately downstream of the mG-CSF coding region. In 
addition, the origin of replication, derived from ColEl. contained 
mutations that conferred a temperature-sensitive Cop" phenotype. E^ 
coll cells, strain 0G95x (x N 7 N 33 C1837 sus P80). harboring pJDl were 
"^n at 30'C to an optical density of 0.3 at 600 nm. The P L promoter 
and plasmid replication were then induced by shifting the temperature 
to 42°C for three hours. 
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The second vector pPD2, consisted of DNA encoding the mG-CSF 
protein under the control of the E. coll Trp promoter. The Bacillus 
thurlngiensis positive retroregulatlng element was Immediately 
downstream of the mG-CSF coding region. E. coll cells, strain M1294, 
5 were grown overnight In M9 medium plus 100 ug/ml of Trp and 50 „g/ml 
of ampldllln. In the morning the cells were washed twice with M9 
medium and the Trp promoter was then induced by setting up cultures at 
an optical density of 0.05 at 600 nm In M9 medium without Trp. The 
cells were grown at 37°C to an optical density of 0.8 at 600 nm. 

10 After induction of pJDl and pPD2, extracts of Induced and 

uninduced cells were prepared by boiling the cells for five minutes 1n 
Laemmll loading buffer (LaemmH, Nature. 227 :680-685 (1970)) which 
contains sodium dodecyl sulfate (SDS) and 2-mercaptoethanol (2ME). 
After electrophoresis through 12.5% polyacryl amide gels (PAGE), 

15 containing SDS and 2ME, and staining with Coomassle brilliant blue, an 
Induced protein band of the molecular weight corresponding to mG-CSF 
could not be detected. 

B. Steady State mG-CSF mRNA Level After Induct ion of Cells 
Harboring pJDl and pPD2 ~~ 



20 



To determine the steady level of G-CSF transcripts In cells 
harboring pJDl and pPD2, RNA was prepared as follows: 20 ml of 
Induced or uninduced cells was mixed with 20 ml of Ice cold phosphate 
buffered saline. The cells were pelleted by centrifugatlon at 4°C and 
resuspended 1n 3 ml of 0.15 M sucrose-20 mM sodium acetate. 0.3 ml of 
25 a 20 mg/ml lysozyme solution was added and the mixture was incubated 
on Ice. 0.15 ml of 20% SDS was added and the mixture was extracted 
with equal volumes of phenol :chloroform (1:1) until the Interface was 
clear (about 4 times). After two ethanol precipitations, the nucleic 
acid mixture was resuspended 1n 100 y l of 20 mM Tris (pH 7.5)-10 mM 
30 MgCl 2 . Two microliters of a 1 mg/ml solution of RNase-free DNase I 
was added and the mixture was Incubated at 37°C for 10 minutes. Five 
yl of 20% SDS was added and the samples were then, phenol rchloroform 
(1:1) extracted twice and ethanol precipitated 2 times. 
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The Northern blotting analysis was performed as described 
using a modification of a published protocol (Man1at1s et al., 
Molecular ClonlngrA Laboratory Manual , Cold Spring Harbor, N.Y. 
(1982)). Briefly, Northern blots were prepared by the electrophoresis 
5 of RNA through 1% agarose gels containing 0.5 M formaldehyde followed 
by blotting onto nitrocellulose filters. After baking the filters for 
1.5 hours at 80°C, they were prehybridized for one hour at 45°C in: 5 
x SSC, 0.2% polyvinylpyrrolidone, 0.2% Ficoll, 0.2% bovine serum 
albumin, 0.1% SDS, 50 mM sodium phosphate pH 7.0, and 100 pg/ml 

10 tRNA. The blots were hybridized for 16 hours at 45°C in a similar 
solution that also contained dextran sulfate (10%) and 10 6 cpm per ml 
of the mG-CSF specific oligonucleotide 5 1 -CGCTGCGCCATCGCCCTGGATCTT-3 1 
that had been labeled with y 32 P ATP and polynucleotide kinase. The 
blots were washed 1n 3.2 M tetramethyl ammonium chloride for 15 minutes 

15 at 62°C. The bands corresponding to mG-CSF tnRNA were then detected by 
autoradiography. 

Figure 10 shows that Induced cells harboring pJDl contained 
a mG-CSF mRNA of the expected size as well as smaller mG-CSF related 
mRNA while cells harboring pPD2 did not contain detectable mG-CSF 
20 mRNA. 

EXAMPLE IV 

A. Construction of a P L Vector with Alternative mG-CSF Codons 

Two oligodeoxyribonucleotldes were chemically synthesized 
and purified and had the following sequences: A * 5'- 
25 AGCTTATGACACCATTAGGAC-3 1 , B « 5 1 -TAATGGTGTCATA-3 1 . 

DNA of pJDl was digested with the restriction enzymes 
Hind UI and Apal and, after electrophoresis, the large fragment was 
Isolated from a Tris-acetate, 0.5% low-mel ting-point agarose gel 
(Crease et al., Methods 1n Enzymology , 101 :78 (1983)). This fragment 
30 was mixed with 20 pmole of each of the above two oligonucleotides. 
This mixture was ligated with T4 DNA Ugase, and used to transform 
competent E. coli cells of the strain DG95. The cells were plated on 
agar plates containing ampicillin and colonies containing recombinant 
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plasmlds (pJD4A and pJD4B) were Identified by colony hybridization 
(Manlatis et al. supra) using oligonucleotide A which had been labeled 
with Y *P ATP and polynucleotide kinase. As shown 1n Figure 11, P JD4A 
and P JD4B, both encode the same protein as pJDl; however, codons 2 3 
and 4 of pJD4A and P JD4B are different from the ones used in pJDl. In 
addition, codon 5 In pJD4B also differs from pJDl. This difference 
between pJ04A and P JD4B results from the fact that the end of the 
oligomer duplex that was ligated to the Apal site contained a single 
mismatch to the single-stranded end generated by the enzyme Apal. 
This type of mismatch will not interfere with ligations (Hung and 
Wens1nk ' Nucleic Acids Research. 21:1863-1874 (1984)); however 
depending on the repair of this mismatch, some of the resulting clones 
will regenerate the Apal site and have a pJDl-Hke 5th codon and some 
will lose the Apal site and have a pJD4B-like 5th codon. The pJD4A 
and pJD4B constructions were verified by DNA sequence analysis. 

B - Expression of mS-CSF Pro tein by D JD4A and pJD4B 

1^ coH harboring either P JD4A or P JD4B were Induced and 
analyzed by SDS-PAGE as described for pJDl in Example III. Figure 12 
shows that an Inducible protein, with a molecular weight corresponding 
to that of mG-CSF. is present in coH cells that harbor either 
pJD4A or pJD4B. 



25 



30 



C. 



Two oligonucleotides were chemically 
purified and had the following sequences: 




synthesized and 



C : 5 • AGCTTATGCCAHAGGAC3 ' 
D: 3' ATACGGTAAT 5' 



DNA of plasmld pJDl was digested with restriction enzymes 
HindHI and Apal an d the large fragment was Isolated as described 1n 
IV.A. above. 20 P M each of oligonucleotides C and D were mixed with 
the large fragment, ligated with T 4 DNA llgase under sticky end 
conditions, and the ligated mixture was used to transform competent E 
coH strain DG95. Cells were plated on agar containing ampicmm a^ 
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Amp resistant colonies containing a recombinant plasmid were 
identified by hybridization with oligonucleotide C, which had been 
labeled with Y 32 P ATP using polynucleotide kinase. The plasmid 
obtained from the probe positive colonies was designated pJD6 and 
encoded a G-CSF species in which the thrj 1s deleted, i.e., v r «6- 
CSF. The correct sequence was confirmed by dideoxy sequencing. 

D . Ex pression of mG-CSF Protein by pPD6 

E. coli DG95 harboring pPD6 was induced and analysed as 
described for pJDl in Example III. 

Figure 15 is a picture of an SDS-PAGE gel of proteins 
produced by E^ coli_ harboring pPD6 and pJD5. The cell clearly shows 
that P PD6 produce an inducable protein with a slightly smaller 
molecular weight than MG-CSF produced by pJD5. 

The protein band produced by pPD6 was isolated and purified 
from a large scale induction of pPD6. The purified protein was 
sequenced using an automated amino acid sequencer and showed that the 
NH 2 -terminus of this protein 1s pro rather than met or thr. 

EXAMPLE V 

A instruction of a Tro Expres sion Vector Containing Alternative 
iG-CSF CodonT 

DNA from P JD4A was digested with the restriction enzymes 
HindHI and BamHI. The small fragment, containing the coding region 
7o7 mG-CSF, was isolated from Tris-acetate 0.5% low-melting-point 
agarose gel. DNA from P PD2 was digested with the restriction enzymes 
25 Hindi 1 1 and BamHI and the large fragment was isolated from a Tris- 
Icltate 0.5% low-melting-point agarose gel. These two fragments were 
ligated together with T4 DNA ligase and used to transform competent^ 
coli of the strain MM294. An ampicillin resistant colony containing 
"^recombinant was picked and the construct was verified by DNA 
30 sequence analysis. The resulting plasmid was designated pPD5. 
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B. Expression of mG-CSF in pPD5 

L. coll harboring pPD5 was Induced and analyzed by SDS-PAGE 
as described for pP02 in the previous example. Figure 13 shows that 
an Inducible protein, with a molecular weight corresponding to that of 
mG-CSF, was present 1n E. coll cells that harbor pPD5. 

C. Using Alternative mG-CSF Codons Increases the Steady State Level 
of mG-CSF mRNA ' ~~ 

The steady state level of mG-CSF mRNA 1n induced cells 
harboring either pPD2 or pPD5 was compared by Northern analysis as 
described above. Figure 14 shows that Induced cells harboring pPD5 
contain a mG-CSF transcript of the expected size, while no mG-CSF 
transcripts were detected 1n induced cells harboring pPD2. 

The following plasmids have been deposited 1n the American 
Type Culture Collection, Rockville, MD USA: 



Plasmid Strain Date Accession # 

PJD4A E. coll K12 DG95A 8/12/86 67,181 

pJD4B T7 coTT K12 DG95A 8/12/86 67,183 

pPD5A E. coji K12 MM294 8/12/86 67,182 

pAWll T7 coTT K12 DG95X 11/8/84 39,918 

pTrp3 "ET coli MC1000 12/18/84 39,946 

pFC54.t T7 ZoTT DG95x 8/7/84 39,789 



These deposits were made under the provisions of the 
Budapest Treaty on the International Recognition of the Deposit of 
Microorganisms for the Purposes of Patent Procedure and the 
Regulations thereunder (Budapest Treaty). This assures maintenance of 
a viable culture for 30 years from date of deposit. The deposits will 
be made available by ATCC under the terms of the Budapest Treaty, and 
subject to an agreement between applicants and ATCC which assures 
permanent and unrestricted availability upon issuance of the pertinent 
U.S. patent. The Assignee herein agrees that If the culture on 
deposit die or be lost or destroyed when cultivated under suitable 
conditions, It will be promptly replaced upon notification with a 
viable spedment of the same culture. Availability of the deposits Is 
not be be construed as a license to practice the Invention 1n 



( 



( 



10 



15 



20 



25 



30 



0256843 

- 40 - 

contravention of the rights granted under the 
authority of any government in accordance with its 

patent laws. 

The invention includes the use of G-CSF as 

defined above or a mutein as defined above in 
preparing a medicament, e.g. in preparing a 
pharmaceutical or veterinary formulation. 

In another aspect the invention includes a 
pharmaceutical or veterinary formulation comprising 
G-CSF as defined above or a mutein as defined above 
formulated for pharmaceutical or veterinary use 
respectively, optionally the formulation being in 
unit dosage form and/or also comprising an acceptable 
diluent, carrier or excipient. 

The inventors have disclosed G-CSF and means 
for the efficient expression thereof. Variations 
within the scope of the invention may be made by 
those ordinarily skilled in the art without departing 
from the essence of the invention as claimed herein. 
The illustrative embodiments given hereinbefore are 
not limiting. 
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CLAIMS : 



1. A modified DNA sequence encoding G-CSF, said G-CSF 
having the effect of stimulating the production of primarily 
| granulocyte or granulocyte-macrophage colonies 1n colony forming assay 

i 

i 5 using bone marrow cell progenitors of an appropriate species, wherein 

i 

the 5' region thereof comprises deoxyriboadenine substituted for the 
deoxyriboguanlne or deoxyribocytosine in the third position of at 
least one codon wherein said substitution does not change the amino 
add sequence of G-CSF, 

10 2. The DNA sequence according to claim 1 wherein said 5 l 

region comprises a DNA sequence encoding the amino acid sequence Thr 
Pro Leu Gly or Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro. 

! 3. The DNA sequence according to claim 2 wherein at least 3 

codons have said substitution. 

i 

» • 

15 4. The DNA sequence according to claims 1, 2 or 3 further 

i 

comprising deoxyri bo thymidine substituted for deoxyribocytoslne in the 
first position of at least one codon. 

5. A modified DNA sequence encoding mature G-CSF wherein 
the 5* region thereof comprises codons selected from the group 

20 consisting of ACA CCA TTA GGA, ACA CCA TTA, ATG ACA CCA TTA GGA and 
ATG ACA CCA TTA. 

6. A modified DNA sequence encoding a mutein of G-CSF, said 
mutein of G-CSF having the effect of stimulating the production of 
primarily granulocyte or granulocyte-macrophage colonies in colony 

25 forming assays using bone marrow cell progenitors of an appropriate 
species, wherein the mutein does not have an NH 2 -terminal methionine 
when produced from a recombinant host. 
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7. The modified DNA sequence encoding a mutein of G-CSF of 
claim 6 wherein the NH 2 -terminal end of said mutein encodes a sequence 
that is processed by met amino peptidase. 

8. The modified DNA sequence encoding a mutein of G-CSF of 
5 claim 7 wherein the DNA sequence encodes an amino terminal end 

selected from the group consisting of met-ala, met-gly, met-pro, met- 
al a-met, met-gly-met, met-ala-ser, met-ala-pro, met-pro- thr, and met- 
pro-leu. 

9. An expression vector comprising the DNA sequence of 
10 claims 1 to 8 in operable linkage with a promoter and ribosome 

binding site. 

10- The expression vector of claim 9 in operable linkage 
with a promoter-ribosome binding site selected from the group 
consisting of P L promoter and ribosome binding sites operable 
15 therewith and Trp promoter-ribosome binding site. 

11. An E. coli host transformed with the expression vector 
of claim 9 or claim lO. 

12. A culture of E. coli transformed with the expression 
vector of claim 10 having a G-CSF content of at least about 3% of 

20 total cell protein to about 10% of total cell protein. 

13. Human recombinant G-CSF • 

14. An mRNA fraction isolated from human cells that encodes 
human G-CSF. 

15. The expression vector of claim 10 comprising pJD4 or 

25 PPD5. 
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16. A muteln of G-CSF, said muteln of G-CSF having the 
effect of stimulating the production of primarily granulocyte or 
granulocyte-macrophage colonies In colony forming assays using bone 
marrow cell progenitors of an appropriate species wherein said muteln 
does not have an NH 2 -term1nal methionine. 

17. The muteln of claim 16 wherein the NH 2 -terminal end of 
said muteln has an amino add sequence that is processed by met amino 
peptidase 1n a recombinant host. 

18. The muteln of G-CSF of claim 17 wherein the amino 
terminal end 1s selected from the group consisting of met-ala, met- 
gly, met-pro, met-ala-met, met-gly-met, met-ala-ser, met-ala-pro, met- 
pro-thr and met-pro-leu. 

19. The muteln of G-CSF of claim 18 which yields Vj-mG-CSF 
when expressed In a recombinant host. 

20. A muteln of mG-CSF selected from the group consisting 
of alaj-mG-CSF, glyj-mG-CSF, prOi-mG-CSF, pro 1 -thr 2 -mG-CSF, ala 1 -met 2 - 
mG-CSF, gly 1 -met 2 -mG-CSF, alaj-serg-mG-CSF, ala^prog-mG-CSF, proj- 
leu 2 -mG-CSF and Vj^mG-CSF. 

21. Vj-mG-CSF. 
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22 A method of producing a modified DNA 
sequence encoding G-CSF, said G-CSF having the effect 
of stimulating the production of primarily 
granulocyte or granulocyte-macrophage colonies in 
colony forming assay using bone marrow cell 
progenitors of an appropriate species, which method 
comprises substituting deeoxyriboadenine at the 5' 
region of the sequence for the deoxyriboguanine or 
deoxyribocytosine in the third position of at least 
one codon wherein said substitution does not change 
the amino acid sequence of G-CSF. 

23. A method according to claim 22 and 
further defined by the specific feature(s) of any one 
or more of claims 2 to 4. 

24. A method of making a modified DNA 
sequence encoding mature G-CSF which includes 
selecting codons for the 5/ region thereof from the 
group consisting of ACA CCA TTA GGA, ACA CCA TTA, ATG 
ACA CCA TTA GGA and ATG ACA CCA TTA. 

25 A method of producing a modified DNA 
sequence encoding a mutein of G-CSF, said mutein of 
G-CSF having the effect of stimulating the production 
of primarily granulocyte or granulocyte-macrophage 
colonies in colony forming assays using bone marrow 
cell progenitors of an appropriate species, wherein 
the mutein does not have an Nonterminal methionine 
when produced from a recombinant host, comprising 
modifying the NH -terminal sequence encoding 
portion of a DNA sequence encoding G-CSF, for example 
by subjecting cDNA encoding native G-CSF to site 
specific mutagenesis to produce an NH 2 -terminal 
sequence wherein the NH 2 -t rminal methionine is 
susceptible to enzymatic cleavage or ligating an 
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oligonucleotide encoding such an Nt^-terninal 
sequence to a specifically digested G-CSF DNA 
sequence . 

26. A method according to claim 25 and 
further defined by the specific feature of claim 7 or 
of claim 8. 

27. A process for preparing an expression 
vector which comprises forming an operable linkage 
between a promoter and ribosome binding site and a 
DNA sequence as defined in any one of claims 1 to 8, 
the promoter-ribosome binding site preferably being 
selected from the group consisting of P L promoter 
and ribosome binding sites operable therewith and Trp 
promoter-ribosome binding site. 

28. A method of making an E. coli cell 
capable of expressing G-CSF which comprises 
transforming an E. coli host with an expression 
vector as defined in claim 9, claim 10 or claim 15. 

29. A process for the preparation of 
recombinant G-CSF which comprises culturing an E. 
coli transformed host as defined in claim 11 or a 
culture as defined in claim 12 or E. coli cells made 
by the method of claim 28 so as to permit expression 
of the E. coli genome. 

30. A process according to claim 29 wherein 
the G-CSF prepared thereby is as defined in any one 
of claims 16 to 21. 

31. A pharmaceutical or veterinary 
formulation comprising G-CSF as defined in claim 13 
or a mutein as defined in any one of claims 16 to 21 
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formulated for pharmaceutical or veterinary use 
respectively, optionally the formulation being in 
unit dosage form and/or also comprising an acceptable 
diluent, carrier or excipient. 

32. The use of G-CSF as defined in claim 13 
or a mutein as defined in any one of claims 16 to 21 
in preparing a medicament, e.g. in preparing a 
pharmaceutical or veterinary formulation as defined 
in claim 31. 
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FIG. 4 

1 AAAAA ft r * GCCCGGAGCCTGCAGCCCAGCCCCACCCAGACCC ATGGCTGGA^CjrGCjCJ^CC 

MetAlaGlyProAlaThr 
-30 

61 CAGAGCCCCATGAAGCTGATGGCCCTGCAGCTGCTGCTGTGGCACAGTGCACTCTGGACA 
GTHTerProMetLysLeuMetAlaLeuGlnLeuLeuteuTrpHisSerAlaLeaTrpThr 

-20 ~ 10 

121 GTGCAGGAAGCCACCCCCCTGGGCCCTGCCAGCTCCCTGCCCCAGAGCTTCCTGCTCAAG 
ValGlnGluAlaThrProLeuGlyProAlaSerSerLeuProGlnSerPheLeuLeuLys 

-11 10 

1 81 TGCTTAGAGCAAGTGAGGAAGATCCAGGGCGATGGCGCAGCG'CTGCAGGAGJ^AGCTGT^T 
CysLeuGluGlnValArgLysIleGlnGlyAspGlyAlaAlaLeuGlnGluLysLeuCys 



20 



241 GCCACC^ACAAGCTGTGCCACCCCGAGGAGCTGGTGCTGCTCGGACACTCTCTGGGCATC 
AlaThrTyrLysLeuCysRisProGluGluLeuValLeuLeuGlyHisSerLeuGlylle 

40 50 

3 01 CCC^GGGCTCCCCTGAGCAGCTGCCCCAGCCAGGCCCTGCAGCTGGCAGGCTGCTTGAGC 
ProTrpAlaProLeuSerSerCysProSerGlnAlaLeuGlnLeuAlaGlyCysLeuSer 
60 ~ 70 

361 CAACTCCATAGCGGCCTTTTCCTCTACCAGGGGCTCCTGCAGGCCCTGGAAGGGATCTCC 
GlnLeuHisSerGlyLeuPheLeuTyrGlnGlyLeuLeuGlnAlaLeuGluGlylleSer 

80 90 

421 CCCGAGTTGGGTCCCACCTTGGACACACTGCAGCTGGACGTCGCCGACTTTGCCACCACC 
ProGluLeuGlyProThrLeuAspThrLeuGlnLeuAspValAlaAspPheAlaThrThr 

100 110 

4 81 ATCTGGCAGCAGATGGAAGAACTGGGAATGGCCCCTGCCCTGCAGCCCACCCAGGGTGCC 
IleTrpGlnGlnMetGluGluLeuGlyMetAlaProAlaLeuGlnProThrGlnGlyAla 
120 130 

541 ATGCCGGCCTTCGCCTCTGCTTTCCAGCGCCGGGCAGGAGGGGTCCTAGTTGCCTCCCAT 
HetProAlaPheAlaSecAlaPheGlnArgArgAlaGlyGlyValLeuValAlaSerHis 
140 ISO 

6 01 CTGCAGAGCTTCCTGGAGGTGTCGTACCGCGTTCTACGCCACCTTGCCCAGCCCTGAGCC 

LeuGlnSerPheLeuGluValSerTyrArgValLeuArgHisLeuAlaGlnPco 

160 170 
661 AAGCCCTCCCCATCCCATGTATTTATCTCTATTTAATATTTATGTCTATTTAAGCCTCAT 

7 21 ATTTAAAGACAGGGAAGAGCAGAACGGAGCCCCAGGCCTCTGTGTCCTTCCCTGCATTTC 
7 81 TGAGTTTCATTCTCCTGCCTGTAGCAGTGAGAAAAAGCTCCTGTCCTCCCATCCCCTGGA 
841 CTGGGAGGTAGATAGGTAAATACCAAGTATTTATTACTATGACTGCTCCCCAGCCCTGGC 
901 TCTGCAATGGGCACTGGGATGAGCCGCTGTGAGCCCCTGGTCCTGAGGGTCCCCACCTGG 
961 GACCCTTGAGAGTATCAGGTCTCCCACGTGGGAGACAAGAAATCCCTGTTTAATATTTAA 

10 21 ACAGCAGTGTTCCCCATCTGGGTCCTTGCACCCCTCACTCTGGCCTCAGCCGACTGCACA 
10 81 GCGGCCCCTGCATCCCCTTGGCTGTGAGGCCCCTGGACAAGCAGAGGTGGCCAGAGCTGG 
1141 GAGGCATGGCCCTGGGGTCCCACGAATTTGCTGGGGAATCTCGTTTTTCTTCTT AAGACT 
1201 TTTGGGACATGGTTTGACTCCCGAACATCACCGACGTGTCTCCTGTTTTTCTGGGTGGCC 
1261 TCGGGACACCTGCCCTGCCCCCACGAGGGTCAGGACTGTGACTCTTTTTAGGGCCAGGCA 
13 21 GGTGCCTGGACATTTGCCTTGCTGGACGGGGACTGGGGATGTGGGAGGGAGCAGACAGGA 

13 81 GGAATCATGTCAGGCCTGTGTGTGAAAGGAAGCTCCACTGTCACCCTCCACCTCTTCACC 

14 41 CCCCACTCACCAGTGTCCCCTCCACTGTCACATTGTAACTGAACTTCAGGA TAAJTAAAG T 
1501 GTTTGCCTCCAAAAAAAAAAAAAAA 
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